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Abstract 

Local convergence analysis of the proximal point method for special class of nonconvex func- 
tion on Hadamard manifold is presented in this paper. The well definedness of the sequence 
generated by the proximal point method is guaranteed. Moreover, is proved that each clus- 
ter point of this sequence satisfies the necessary optimality conditions and, under additional 
assumptions, its convergence for a minimizer is obtained. 

Key words: proximal point method, nonconvex functions, Hadamard manifolds. 

1 Introduction 

The extension of the concepts and techniques of the Mathematical Programming of the Euclidean 
space R n to Riemannian manifolds is natural. It has been frequently done in recent years, with 
a theoretical purpose and also to obtain effective algorithms; see p~], [3], [TO], [TT], [H], pj5], [2D] . 
|23j and [25]. In particular, we observe that, these extensions allow the solving some nonconvex 
constrained problems in Euclidean space. More precisely, nonconvex problems in the classic sense 
may become convex with the introduction of an adequate Riemannian metric on the manifold (see, 
for example [9]). The proximal point algorithm, introduced by Martinet [17] and Rockafellar |21j . 
has been extended to different contexts, see |11] , [19] and their references. In [11] the authors 
generalized the proximal point method for solve convex optimization problems of the form 

(P) min/(p) , . 

s.t. p G M, 1 ' 
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where M is a Hadamard manifold and / : M — > R is a convex function (in the Riemannian sense). 
The method was described as follows: 



with p° £ M an arbitrary point, d the intrinsic Riemannian distance (to be defined later on) and 
{Afc} a sequence of positive numbers. The authors also showed that this extension is natural. 
As regards to |19j the authors generalized the proximal point method with Bregman distance to 
solve quasiconvex and convex optimization problems also on Hadamard manifold. Spingarn in [23] 
has, in particular, developed the proximal point method for the minimization of a certain class of 
nondifferentiable noncovex functions, namely, the lower-C 2 functions defined in Euclidean spaces, 
see also [13J . Kaplan and Tichatschke in |15j also applied the proximal point method for the 
minimization of a similar class of the ones of [13] and [23] , namely, functions defined as maximum 
of a certain collection (finite/infinite) of continuously differentiable functions. In [2] we study, in 
the Riemannian context, the same class of functions studied in [15J. In that context we applied 
the proximal point method ([2]) to solve the problem ([1]), however we assumed that the collection of 
functions defining the objective function was finite. 

Our goal is to extend the results of [2J. We consider that the objective function is given by the 
maximum of a collection infinite of continuously differentiable functions. To obtain the results in 
[2J, it was necessary to study the generalized directional derivative in the Riemannian manifolds 
context. In this paper we go further in the study of properties of the generalized directional 
derivative in order to analyze the convergence of the proximal point method. Several works have 
studied such concepts and presented many useful results in the Riemannian optimization context, 
see for example [3], [32], p2] and [25] . 

The paper is divided as follows. In Section 11.11 we give the notation and some results on 
the Riemannian geometry which we will use along the paper. In Section [2] we recall some facts 
of the convex analysis on Hadamard manifolds. In Section [3] we present definition of generalized 
directional derivative of a locally Lipschitz function (not necessarily convex) which, in the Euclidean 
case, coincides with the Clarke's generalized directional derivative. Moreover, some properties of 
that derivative are presented, amongst which the upper semicontinuity of the directional derivative. 
In Section H] we study the proximal point method ([2]) to solve the problem dU, in the case where the 
objective function is a real-valued function (non necessarily convex) on a Hadamard manifold M 
given by the maximum of a certain class of functions. Finally in Section [5] we provide an example 
where the proximal point method for nonconvex problems is applied. 

1.1 Notation and terminology 



In this section we introduce some fundamental properties and notations on Riemannian geometry. 
These basics facts can be found in any introductory book on Riemannian geometry, such as in [6] 




(2) | E: 1 . 2 
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Let M be a re-dimentional connected manifold. We denote by T„M the n-dimentional tangent 
space of M at p, by TM = U pg j\/T p M tangent bundle of M and by A?(M) the space of smooth 
vector fields over M. When M is endowed with a Riemannian metric ( , ), with the corresponding 
norm denoted by || ||, then M is now a Riemannian manifold. Recall that the metric can be used to 
define the lenght of piecewise smooth curves 7 : [a, b] — > M joining p to q, i.e., such that 7(a) = p 
and 7(6) = q, by 



and, moreover, by minimizing this length functional over the set of all such curves, we obtain a 
Riemannian distance d(p, q) which induces the original topology on M. The metric induces a map 
/ 1 — y grad/ G X(M) which associates to each smooth function on M its gradient via the rule 
(grad/, X) = df(X), X G X(M). Let V be the Levi-Civita connection associated to (M, ( , )). A 
vector field V along 7 is said to be parallel if Vy V = 0. If 7' itself is parallel we say that 7 is a 
geodesic. Given that geodesic equation V y 7' = is a second order nonlinear ordinary differential 
equation, then geodesic 7 = 7« (.,£>) is determined by its position p and velocity v at p. It is easy to 
check that ||7'|| is constant. We say that 7 is normalized if ||7'|| = 1. The restriction of a geodesic 
to a closed bounded interval is called a geodesic segment. A geodesic segment joining p to q in M 
is said to be minimal if its length equals d{p, q) and this geodesic is called a minimizing geodesic. 
If 7 is a curve joining points p and q in M then, for each t 6 [a,b], V induces a linear isometry, 
relative to ( , ), Py a w t ) : Ty a )M — > Ty t )M, the so-called parallel transport along 7 from 7(a) to 
7(t). The inverse map of P^{a)i(t) ^ s denoted by P~A , t , : Ty t )M — > Ty a )M. In the particular case 
of 7 is the unique curve joining points p and q in M then parallel transport along 7 from p to q is 
denoted by P pq : T p M -> T g M. 

A Riemannian manifold is complete if geodesies are defined for any values of t. Hopf-Rinow's 
theorem asserts that if this is the case then any pair of points, say p and q, in M can be joined by 
a (not necessarily unique) minimal geodesic segment. Moreover, (M, d) is a complete metric space 
and bounded and closed subsets are compact. Take p € M. The exponential map exp p : T p M — > M 
is defined by exp p v = j v (l,p). 

We denote by R the curvature tensor defined by R(X,Y) = VjfVyZ — VyVxZ — V[y,x]Z, 
with X, Y, Z 6 X(M), where [X, Y] = YX — XY. Then the sectional curvature with respect to X 



and Y is given by K(X, Y) = {R(X,Y)Y,X)/(\\X\\ 2 \\X\\ 2 - {X , Y) 2 ), where ||X|| = (X,X) 1 / 2 . If 



K(X, Y) ^ for all X and Y, then M is called a Riemannian manifold of nonpositive curvature 
and we use the short notation K ^ 0. 

Theorem 1.1. Let M be a complete, simply connected Riemannian manifold with nonpositive 
sectional curvature. Then M is diffeomorphic to the Euclidean space W 1 , n = dimM. More 
precisely, at any point p G M , the exponential mapping exp p : T p M — >■ M is a diffeomorphism. 




Proof. See [6j and [22]. 



□ 



A complete simply connected Riemannian manifold of nonpositive sectional curvature is called 
a Hadamard manifold. The Theorem 11.11 says that if M is Hadamard manifold, then M has the 
same topology and differential structure of the Euclidean space M n . Furthermore, are known some 
similar geometrical properties of the Euclidean space R n , such as, given two points there exists an 
unique geodesic that joins them. In this paper, all manifolds M are assumed to be Hadamard finite 
dimensional. 
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2 Convexity in Hadamard manifold 

In this section, we introduce some fundamental properties and notations of convex analysis on 
Hadamard manifolds which will be used later. We will see that these properties are similar to those 
obtained in convex analysis on the Euclidean space M. n . References to convex analysis on Euclidean 
space R n are in [14], and on Riemannian manifold are in [7J, [TTJ, |20| . [22], |23| and |25| . 

The set Q C M is said to be convex if any geodesic segment with end points in Q is contained 
in fi. Let fi C M be an open convex set. A function / : M — > R is said to be convex (respectively, 
strictly convex) on fi if for any geodesic segment 7 : [a, b] — > f2 the composition / o 7 : [a,b] — > R is 
convex (respectively, strictly convex). Now, a function / : M — > R is said to be strongly convex on 
fi with constant L > if, for any geodesic segment 7 : [a, b] — > f2, the composition / 07 : [a, b] — )■ R 
is strongly convex with constant L||7'(0)|| 2 . Take p € M. A vector s € T p M is said to be a 
subgradient of / at p if 

f(q)>M + (s,exPr 1 q), 
for any q € M. The set of all subgradients of / at p, denoted by df(p), is called the subdifferential 
of / at p. 

Take p € M . Let exp~ l : M — > T p M be the inverse of the exponential map which is also C°°. 
Note that d(q , p) = Wexp^qW, the map d 2 ( . ,p) : M — > R is C°° and 

graded 2 (q,p) = -exp~ l p, 

(remember that M is a Hadamard manifold) see, for example, |22j . 
Proposition 2.1. Ta/ce p € M. The map d 2 { . ,p)/2 is strongly convex. 

Proof. See [7J. □ 



Definition 2.1. Let £1 C M be an open convex set. A function f : M 
on £1 if there exists a constant L := L(f2) > such that 

\f(p)~ f(q)\<Ld(p,q), p,qett. 



is said to be Lipschitz 



(3) Lipschl 



Moreover, if it is established that for all po € there exists L(j>o) > and 5 = 6(po) > such that 
the inequality (T3|) occurs with L = L(po) for all p,q G B$(po) := {p E Q, : d(p,po) < 5}, then f is 
called locally Lipschitz on £1. 
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Remark 2.1. As an immediate consequence of the triangular inequality we obtain that \d(p,po) — 
d(q,Po)\ < d(p,q) for allp,q and po G M. Then, of the Definition \2.1{ we get that the Riemannian 
distance function to a fixed point, d(-,q) is Lipschitzian and therefore Lipschitzian locally. In fact, 
it well known that every convex function is locally Lipschitz and consequently continuous. See 



Proposition 2.2. Let Vt C M be a open convex set, f : M — > 1R and p G M. If there exists A > 
such that f + (A/2) d 2 (. ,p) : M — > ]R is convex on $7, then f is Lipschitz locally on f2. 

Proof. Because / + (A/2) d 2 (. ,p) is convex, it follows from Remark 12.11 that for any p £ f2 there 
exist L±,Si > such that 

\[f( gi ) + (X/2)d 2 ( gi ,p)} - [f(q 2 ) + (X/2)d 2 (q 2 ,p)}\ < L 1 d(q 1 ,q 2 ), V q u q 2 G B(p,5 x ). (4) 

Moreover, Proposition 12.11 together with Remark 12.11 imply that there exist L 2 ,5 2 > such that 

\(l/2)d 2 (q 1 ,p)-(l/2)d 2 (q 2 ,p)\<L 2 d(q 1 ,q 2 ), V qi, q 2 G B(p, 5\). (5) 

Simples algebraic manipulations implies that 

\f(qi) ~ < \[f(qi) + A/2) d 2 ( Ql ,p)} - [f(q 2 ) + (A/2) d 2 (q 2 ,p)}\ + 

+ \(X/2)d 2 (q 2 ,p)-(X/2)d 2 ( qi ,p)\. 

Therefore, taking 5 = min{#i, ^2}, using ([4]) and ([5]) we conclude from last inequality that 

|/(<7l)-/(<72)| < (L 1 + XL 2 )d(qi,q 2 ), V q\, q 2 G B(p, 5), 

and the proof is finished. □ 

Definition 2.2. Let C M be a open convex set and f : M —> R a continuously differentiable 
function on £1. The gradient vector field grad/ is said to be Lipschitz with constant T > on Q 
always that 

II grad /(g) - P pq gr&d f (p)\\ < Fd(p,q), p,qeft, 
where P pq is the parallel transport along the geodesic segment joining p to q. 



DesLiplOO 
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3 Generalized directional derivatives 

sec : dd 

In this section we present definitions for the generalized directional derivative and subdifferential 
of a locally Lipschitz function (not necessarily convex) which, in the Euclidean case, coincide 
with the Clarke's generalized directional derivative and subdifferential, respectively. Moreover, 
some properties of those concepts are presented, amongst them the upper semicontinuity of the 
directional derivative and a relationship between the subdifferential of a sum of two Lipschitz locally 
function (in the particular case that one of them is differentiable) and its subdifferentials. 
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d: Clarke 



Definition 3.1. Let Q C M be an open convex set and f : M — > R a locally Lipschitz function on 
Q. The generalized directional derivative f° : TQ — > R of f is defined by 



f (exp, t{D exp p ) exp -i q v) - f(q) 



f°(p, v) := lira sup , (6) Clarke 1 

40 q^p t 

where (D exp p ) exp -i q denotes the differential o/exp p at expp 1 q. 

It is worth to pointed out that an equivalently definition has appeared in [4]. 

Remark 3.1. The generalized directional derivative is well defined. Indeed, let L p > the Lipschitz 
constant of f at p and 5 = 5(p) > such that 

\f(exp q t(Dexjp p ) exp -i q v) - f(q)\ < L p d(exp g t(D exp p ) exp -i q v, q), q G B s (p), t G [0,5). 
Since d(exp q tp ex.p p ) exp -i q v, q) = t\\(D exp p ) exp -i q v\\, above inequality becomes 

\f{exp q t{Dexp p ) e ^-i q v) - f(q)\ < L p t\\p ex Vp ) exp -i q v\\, q G B s (p), t G [0,5). 
Since lim (? _ s> p (D exp p ) exp -i q v = v our statement follows from last inequality. 

Remark 3.2. Note that, if M = R n then exp p w =p + w and p exp p ) exp -i q v = v. In this case, 
([U becomes 

^ x r f(q + tv)-f(q) 
J \P, v) = hm sup , 

40 q^p 1 

which is the Clarke's generalized directional derivative, see [5]. Therefore, the generalized differen- 
tial derivative on Hadamard manifold is a natural extension of the Clarke 's generalized differential 
derivative. 

Now we are going to prove the upper semicontinuity of the generalized directional derivative. 



'ropoConverg Proposition 3.1. Let C M be an open convex set and f : M — > R be a locally Lipschitz function. 

Then, f° is upper semicontinuous on TQ, i.e., if (p,v) G TQ and {p k ,v k } is a sequence in TQ such 
that limfc_ i , +00 (p fc , v k ) = (p,v), then 

limsupf°(p k ,v k )<f (p,v). (7) 

k— >+oo 

Proof. Let (p,v) G TQ and {(p k ,v k )} C TQ such that limk^ +00 (p k , v k ) = (p,v). For proving the 
inequality ([7]) first note that for each k 

f°{p k ,v k )< limsup f(^Pq tw ) ~ f(l) ^ ( qiW ) e TQ. 

40 (q,w)->-(p k ,v k ) * 



DerFinita2 
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So, by definition of upper limit, there exists (q ,w 1 ) G T£l — {(p , v )} and ifc > such that 



/°(p*w fc ) 



1 /(exp * t k w k ) - f(q k 



k 



< 



t k 



d{{q*,w%(p",v«))+t k < v 



(8) UpperSeml 



with d being the Riemannian distance in TM. Let U p C f2 be a neighborhood of p such that 
TC/p ~ Up x M n , / is Lipschitz in J7 P with constant L p and exp is Lipschitz on TU P with constant 
X. Prom the first inequality in ([8]), we obtain 



if (ex P(?fc t k (D exp p ) exp -! gfcU J - f(q k ) 



+ 



f(exp qk t k w k ) - f (exp qk t k (Dexjp p ) exp -i qk v 



t k 



. (9) UpperSem3 



On the other hand, as lim k ^. +oa (p , v k ) = (p, v), we conclude from the second inequality in (|8|) that 

exp q fc t k w k G U p , exp q k t k (D exp p ) exp -i qk v) G U p , k> k , 
for ko sufficiently large. Thus, as / is Lipschitz on U p , for k > ko we have 



f(exp qk t k w k ) - f (exp gfc t k (Dexp p ) exp -i qk v^j 



< 



L p d(exp k t k w k ,exp k t k (Dexp) -i k v). (10) I UpperSeml 



Now, taking into account that exp is Lipschitz on TU P , in the particular case that k > ko 



d [exp qk t k w k , exp qk t k (D exp p ) exp -i qk vj < k\\t k w k - t k {D exp p ) exp -i qk v\\. 
Since \\m. k ^ f+00 p k = p, second equation in ([8|) imply that lim k _ ) . +00 q k = p. Consequently, 



lim (Dexp ; 

fc— >+oo 



-1 V = V, 
exp p q k 



which, together with last inequality imply 



lim d exp k t k w , exp k t k (D exp ) -i k v) jt k = 0. 

fc— >-+oo \ V P » / 

Therefore, combining last equation, Q, (|10p . and Definition 13,11 the result follows. 



□ 



Next we generalize the definition of sub differential for locally Lipschitz functions defined on 
Hadamard manifold. 



regular 1 
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Definition 3.2. Let C M be an open convex set and f : M — > M. a locally Lipschitz function on 
fL The generalized subdifferential of f atp e£l, denoted by d°f(p), is defined by 

d°f(p) := {w G T p M : f(p,v) > (w,v),V v G T p M}. 

Remark 3.3. Let 0, C M be an open convex set. If the function f : M — > M is convex on £1, 
then f°(p,v) = f'(p,v) (respectively, d°f(p) = df(p)) for all p G ft, i.e., the directional derivatives 
(respectively, subdifferential) for Lipschitz functions is a generalization of the directional derivatives 
(respectively, subdifferential) for convex functions. See Claim 5.4 in the proof of Theorem 5.3. 

Definition 3.3. Let f : M — > R be locally Lipschitz function. A point p G is said to be a 
stationary point of f always G d°f(p). 

Lemma 3.1. Let C M be an open set. If f : M R is locally Lipschitz function on Q and 
g : M —>M is convex on Q, then 

(f + g)°(p,v) = f°(p,v)+g'(p,v) pen, v G TpM. (11) 

Moreover, if g is differentiate, we have 

d (f + g)(p)=d°f(p) + gr a dg(p), p E Q. (12) 

Proof. Using the definition of the generalized directional derivative and simple algebraic manipu- 
lations, we obtain 



ddg:l 



sdg:l 



(f + 9)°(p,v) 



lim sup 



f(ex.-p q t(Dexp p ) exp -i q v) - f(q) 



+ 



<7(exp 9 t(D exp 



P ; exp p q 



v) -g{q) 



t 

From basic properties of the upper limit along with the definition of directional derivative general- 
ized and Remark 13.31 follows that 



(f + gy(p,v)<f°(p,v)+g'(p,v). 
Now, as f°(p,v) = ((/ + g) + {—g))° (p,v), above inequality implies, in particular 

f°(p,v)<(f + g) (p,v) + (-gy(p,v), 

which is equivalent to 

(f + g)°(p,v) >f°(p,v)+g'(p ; v). 

Thus, combining last inequality with inequality (fl~3j) . the equality (fTTl) is obtained. 

In the case that g is also differentiable, in particular g'(p,v) = (gradg(p),v). Therefore, the 
proof of the equality (|12H is an immediate consequence of the equality (llip along with the definition 
of the generalized subdifferential. □ 



(13) I P : ddgl 
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c:ssfd Corollary 3.1. Let Q C M be a open convex set, f : M R be locally Lipschitz functions on Q, 
p G M and A > such that f + (A/2) d 2 (. ,p) : M — > R is convex on Q. If p G Q, is a minimizer of 
f + {X/2)d 2 {.,p) then 

Aexp^p G d°f(p). 

Proof. Since p is a minimizer of / + (X/2)d 2 (.,p) we obtain 



(14) eq:mcl 



On the other hand, as / + (A/2) d 2 (. ,p) is convex on f2 and (A/2) d 2 (. ,p) is differentiable with 
grad (A/2) d?(q, p) = — Aexp" 1 ^, using Remark 13.31 Proposition 12.21 and applying Lema [37X1 with 
g = (A/2) d 2 {. ,p), we have 



d (f + ^(., p)) (p) =d°(f + ^d 2 (., p)\ (p) = d°f(p) - Aexp-V 



Therefore, the result follows by combining (|14() with (I15p . 



(15) 
□ 



eq:mc2 



4 Proximal Point Method for Nonconvex Problems 

sec5 

In this section we present an application of the proximal point method for minimize a real- valued 
function (non necessarily convex) given by the maximum of a certain class of continuously differ- 
entiable functions. Our goal is to prove the following theorem: 

MPP10 Theorem 4.1. Let Q C M be an open convex set, q G M and T C R a compact set. Let <p : 
M x T — > R be a continuous function on f2 x T such that </?(., t) : M — > R is a continuously 
differentiable function on O ( closure of Q) for all r G T , and f : M — > R defined by 

f(p) : = max TeT (/}(p,r). 

Assume that -co < inf pe j\,/ f(p), grad p y>(., r) is Lipschitz on ft with constant L T for each r G T 
smc/i i/iat sup TgT L r < +oo and 

£/(/(?)) = e M : /(p) < f(q)} C £2, inf /(p) < f(q). 

peM 

Take < A and a sequence {Xk} satisfying sup TgT L T < A^ < A and p G Lf(f(q)). Then the 
proximal point method 

p k+1 := a rgmm peM (f(p) + ^d 2 (p,p k )\, A; = 0,1,..., (16) [¥7T^ 

with starting point p° = p is well defined, the generated sequence {p k } rests in Lt (f(q)) and satisfies 
only one of the following statement 
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re : tconver 
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cor : wdf 



i) {p k } is finite, i.e., p k+1 = p k for some k and, in this case, p k is a stationary point of f , 

ii) {p k } is infinite and, in this case, any cluster point of {p k } is a stationary point of f . 
Moreover, assume that the minimizer set of f is non-empty, i. e., 

hi) U* = {p: f(p) = inf peM f(p)} + 0. 

Let c G (infpgjw /(p), f(q)). If, in addition, the following assumptions hold: 
h2) Lf(c) is convex, f is convex on Lf(c) ; 

h3) for allp G L f (f(q)) \L/(c) and y(p) G d°f(p) we have \\y{p)\\ > 5 > 0, 
then the sequence {p k } generated by (fT6|) with 

k = 0,1,... 



supL T < Afc < A, 



(17) Imppl02 



converge to a point p* 6 [/*. 



Remark 4.1. The continuity of each function </?(., r) on f2 in h2 guarantees that the level sets of 
the fuction f, in particular the solution set U* , are closed in the topology of the manifold M . 

In the next remark we show that if il is bounded and tp(., r) is convex on Q for all r G T then 
/ satisfies the assumptions h2 and h3. 

Remark 4.2. If <p(.,r) is a convex function on 0, for all t G T then the assumtion h2 is naturally 
verified and if hi hold then h3 also holds. For details, see f^jj. 

In order to prove above theorem we need of some preliminary results. From now on we assume 
that every assumptions on Theorem 14.11 hold, with the exception of hi, h2 and h3, which will be 
considered to hold only when explicitly stated. 

Lemma 4.1. For all p G M and A satisfying 

supL T < A, 

function f + (X/2)d 2 (. ,p) is strongly convex on with constant A — sup Tgr L T . 

Proof. Since T is compact and cp is continuous the well definition of / follows. To conclude, see 
Lemma 4.1 in [2]. □ 

Corollary 4.1. The proximal point method (I16D applied to f with starting point p° = p is well 
defined. 
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Proof. Since compactness play no rule, the proof is equal to the proof of Corollary 4.1 in [2]. □ 
mpproxlO Lemma 4.2. Let {p k } be the sequence generated by the proximal point method (|16p. Then 

i) G d (f + ^d 2 {. , p k fj (p k+1 ), k = 0,1, 

ii) lim d(p k+l ,p k ) = 0. 

s— ¥oo 

Moreover, if Xk satisfies (|17p and hi, h2 and h3 hold, then {p k } converges to a point p* € U*. 

Proof. Since compactness play no rule, the proof is similar to the proof of Lemmas 4.2, 4.3 and 4.4 
of [2]. □ 



Proof of Theorem 14.11 

Proof. The well definition of the proximal point method (|16p follows from the Corollary 14.11 Let 
{p k } be the sequence generated by proximal point method. Because p° = p € Lf(f(q)), (fl~6j) implies 
that the whole sequence is in Lf(f(q)). From item i of Lemma 14.21 we have 



OE0 / + 



Xk 



d\,p k ))(p k+1 ), k 



0,1, 



Since sup reT L T < A^, Lemma [4TT1 implies that / + (Xj t /2)d 2 (. ,p k ) is strongly convex on f2, which 
together with Proposition 12.21 give us that / is locally Lipschitz on Q. So, using the definition of 
p k+l , we conclude from Corollary 13.11 with X = X^, p = p k and p = p k+1 that 



Afcexp-^P* £d°f(p k+1 ). 



(18) 



If {p k } is finite, then p k+1 = p k for some k and latter inclusion implies that € d° f(p k+1 ), i.e., 
p k is a stationary point of /. Now assume that {p k } is a infinite sequence. If p is a cluster point 
of {p k }, then there exists a subsequence {p ks } of {p k } such that lim s ^. +00 p k 
Lemma 14.21 implies 

lim <l(p h » 



lim \\expj s+1 p k 



Now, from the relation ()18p . we have 

f°(p ks+ \v)>X ks (ew;l + iP k %v), 
Let v G TpM. Hence, latter inequality implies that 

/V* + V +1 ) > x ks (^v- p Up k %v k ° +l ), 



1 = p and item ii of 
(19) 



Vt<6 T pks+ iM. 



I3 ( ex Pp)expri p ^+i^ 



MPP12 



AcumEstacl 
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Note that lim s _ >+00 p' Cs+1 = p implies lim s _> +00 v ks+1 = v. Because {Xk s } is bounded, letting s goes 
to +00 in the last inequality, Proposition 13.11 together with (fT9|) give us 

f°(p,v) > lim supf°(p ks+1 ,v k ° +1 ) > 0, 

s— ¥+00 

which implies that £ d° f(p), i.e., p is a stationary point of / and the first part of the theorem is 
concluded. 

The second part follows from the last part of Lemma 14.21 and the proof of the theorem is 
finished. □ 



sec6 



5 Example 

Let (R++,(, }) be the Riemannian manifold, where R++ = {x € R : x > 0} and ( , ) is the 
Riemannian metric (u,v) = g(x)uv with g : M++ — > (0, +00). So, the Christoffel symbol and the 
geodesic equation are given by 



. . 1 1, s dq(x) d , / . . 



(i 2 a; . f dx 



0. 



respectively. Besides, in relation to the twice differentiable function /i : 
and the Hessian of /i are given by 

grad /i = g~ l h' hess h = h" — Th', 



I, the Gradient 



respectively, where h! and /i" denote the first and second derivatives of h in the Euclidean sense. 
For more details see [25J. So, in the particular case of g(x) = x~ 2 , 



r(x) = —x , gradh(x) = x h'(x), hess h(x) = h"(x) + x h'(x). 
Moreover, the map if) : R - 



(20) 



and the manifold 



defined by ^(a;) = e^ is an isometry between the Euclidean space 

x R ++ — > R + is given by 



{ , }), and the Riemannian distance d : R++ 
d(x,y) = = \ ln(x/y)\, 



(21) 



Hess : 1 



dRiem: 1 



see, for example [9]. Therefore, (R++,(, )) is a Hadamard manifold and the unique geodesic 
x : R — > R++ with initial conditions x(0) = xq and x'(0) = v is given by 

z(t) = x e (,;/:ro) *. 

Now let /1, /2, / : R++ — > R and 99 : R++ x [0, 1] — >• R be real- valued functions such that 

<p(x,r) = fx(x) +t(f 2 (x) - /i(x)), /(x) = max Te[0jl] ip(x,r), (22) |eq:dfs 
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and consider the problem 

min f(x) 

s.t. x e 

Take a sequence {A&} satisfying < A&. From (j2Tj) . the proximal point method (fTBT) becomes 

:= argmin^^ {/(*) + ^ In 2 ( J-) | , k = 0, 1, ... . 

If /i and /2 are given, respectively, by fi(x) = ln(x) and /^(x) = — ln(x) + e~ 2x — e -2 , then tp is 
continuous and </?(•, t) is continuously differentiable for each r € [0, 1]. The last expression in (|20p 
implies that 

hess/i(a;) = , hess f 2 (x) = (4 - 2/x)e~ 2x , x G (23) 
e, as a consequence, first expression in ([22]) give us 

hess x ip(x,r) = rhess f2(x), Vx G M++ Vr € [0,1]. 

Note that, for < e < 1/4 and SI = (e, +oo), hess f 2 is bounded on SI and therefore grad/2 
is Lipschitz on Q. We denote by L the constant of Lipschitz of grad/2. From the last equality 
hess x t) is also bounded on and grad^ (/?(., r) is Lipschitz on with constant L T = tL for all 
r G [0, 1]. Besides, sup rg [ 01 ] L T = L < +00. 

We claim that f(x) = max^i^ fj(x). Indeed, note that f2(x) — fi(x) > for x € (0,1), 
f2(x) — fi(x) < for x G (l,+oo) and /i(l) = /2(1)- Thus the affine function [0, 1] 9r 4 ip(x,r) 
satisfies 

max re[0il]V ?(x,r) = i 

[f2{x), x 6(1, +00). 

and the claim follows. With that characterization for / all assumptions of Theorem 14 . 1 1 are verified, 
with q = 5/16, c = /(3/4) and <5 = 2/5, see Example in [2]. Hence, letting x° 6 K++ and A > such 
that x° 6 Lf(f(q)) and L < \i < Afe < A, the proximal point method, characterized in Theorem l4.H 
can be applied for solving the above nonconvex problem. 

Remark 5.1. Function f(x) = max T ip(x , t) , in the above example, is nonconvex (in the Euclidean 
sense) when restricted to any open neighborhood containing its minimizer x* = 1. Therefore, the 
local classical proximal point method (see [75]/ ) cannot be applied to minimize that function. Also, 
as f is nonconvex in the Riemannian sense, the Riemannian proximal point method (see 11 If ) can 
not be applied to minimize that function, see Example in JM/ for more details. 

6 Final Remarks 

We have extended the application of the proximal point method to solve nonconvex optimization 
problems on Hadamard manifold in the case that the objective function is given by the maximum 
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of a certain infinite collection of continuously differentiable functions. Convexity of the auxiliary 
problems is guaranteed with the choice appropriate regularization parameters in relation to the 
constants of Lipschitz of the field gradients of the functions which they compose the class in 
subject. With regards to the Theorem 14.11 in the particular case that (p(.,r) is convex on Q for 
r £ T, convexity of the auxiliary problems is guaranteed without need of restrictive assumptions 
on the regularization parameters. Besides, as observed in Remark 14.21 the additional assumptions 
h2 and h3 are satisfied whenever Q is bounded. 
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